HydroZIP: How Hydrological Knowledge can Be Used to Improve Compression of Hydrological Data
نویسندگان
چکیده
From algorithmic information theory, which connects the information content of a data set to the shortest computer program that can produce it, it is known that there are strong analogies between compression, knowledge, inference and prediction. The more we know about a data generating process, the better we can predict and compress the data. A model that is inferred from data should ideally be a compact description of those data. In theory, this means that hydrological knowledge could be incorporated into compression algorithms to more efficiently compress hydrological data and to outperform general purpose compression algorithms. In this study, we develop such a hydrological data compressor, named HydroZIP, and test in practice whether it can outperform general purpose compression algorithms on hydrological data from 431 river basins from the Model Parameter Estimation Experiment (MOPEX) data set. HydroZIP compresses using temporal dependencies and parametric distributions. Resulting file sizes are interpreted as measures of information content, complexity and model adequacy. These results are discussed to illustrate points related to learning from data, overfitting and model complexity.
منابع مشابه
Data compression to define information content of hydrological time series
When inferring models from hydrological data or calibrating hydrological models, we are interested in the information content of those data to quantify how much can potentially be learned from them. In this work we take a perspective from (algorithmic) information theory, (A)IT, to discuss some underlying issues regarding this question. In the information-theoretical framework, there is a stron...
متن کاملPredicting the ungauged basin: model validation and realism assessment
The hydrological decade on Predictions in Ungauged Basins (PUB) led to many new insights in model development, calibration strategies, data acquisition and uncertainty analysis. Due to a limited amount of published studies on genuinely ungauged basins, model validation and realism assessment of model outcome has not been discussed to a great extent. With this paper we aim to contribute to the d...
متن کاملFuture climate change impact on hydrological regime of river basin using SWAT model
Hydrological components in a river basin can get adversely affected by climate change in coming future. Manipur River basin lies in the extreme northeast region of India nestled in the lesser Himalayan ranges and it is under severe pressure from anthropogenic and natural factors. Basin is un-gauged as it lies in remote location and suffering from large data scarcity. This paper explores the imp...
متن کاملHydrological Drought Forecasting Using Stochastic Models (Case Study: Karkheh watershed Basin)
Hydrological drought refers to a persistently low discharge and volume of water in streams and reservoirs, lasting months or years. Hydrological drought is a natural phenomenon, but it may be exacerbated by human activities. Hydrological droughts are usually related to meteorological droughts, and their recurrence interval varies accordingly. This study pursues to identify a stochastic model (o...
متن کاملStatistical downscaling of GRACE gravity satellite-derived groundwater level data
With the continued threat from climate change, population growth and followed by increasing water demand, the need for hydrological data with high spatial resolution and proper time coverage to be felt more than ago. Therefore, having data such as terrestrial water storage changes and groundwater level changes with high resolution spatial helps to plan and make decisions for water resource mana...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Entropy
دوره 15 شماره
صفحات -
تاریخ انتشار 2013